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POSITIVELY SKEWED IF TAKEN AS "AMOUNT OF TIME PER UNIT OF 
PERFCRMANCE." THE USE OF "UNITS OF TIME PER UNIT OF SPEECH 
OUTPUT" WAS SHCWN TO BE MORE EFFECTIVE IF VERY SMALL TIME 
UNITS ARE USED TO REPORT SPEECH RATE BATA. "WORDS PER MINUTE" 
WAS DISCOUNTED AS A UNIT OF SPEECH BECAUSE WORDS VARY IN 
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/ Problems of Measuring Speech Rate 



John B. Carroll 
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{For presentation at a conference on Speech Compression, U. of Louisville, 
Louisville, Kentucky, October 19, 1966] 



Speech rate Is clearly a critical variable In speech compression, both In 
describing the Input to any speech compression system and In characterizing the 
output. Reliable and meaningful measures of speech rate are "musts" If we hope 
to appraise what a speech compression system actually does. It cannot be assumed 



that Just because a system has a certain compression ratio, such as 75% , It 
will always produce speech at a certain rate, because obviously the output rate 
will depend upon Input rate as well as the compression ratio. This fact seems 



to have been forgotten by some of the more ardent exponents of speech compression 
when they report output rates of, say, 500 wpm by merely using the compression 
ratio In their calculation, without making careful measurements of the Input 
rates. 

As I will show, measuring speech rate Is not a simple matter of counting 
words per minute, for this measure can give misleading results. 

Measurements of speech rate enter our considerations at another point. 

A tacit assumption In most speech compression work Is that this process will 
have an output that Is in some way unusual, l.e. not normally producible by the 
unaided speaker. This raises the question of what Is In fact normal, usual, • 
or typical, and also the question of where the dividing line Is between what Is 
normal, usual, and typical and what Is abnormal, unusual, or atypical. To 
answer this question we need not only reliable normative data on normal speech 
rates but also psychophysical studies of subjective responses to speech at 
various rates (Hutton, 1954) . Although the literature contains some leads on 
these matters, it Is not adequate for our purposes. 
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Purther, we need to know something about the ability of speakers to control 
their speech rates at designated levels or to vary them on demand. This Is 
undoubtedly an Important matter In connection with the production of speech 
recordings for the blind. Likewise we need to know much more about the ability 
of listeners to comprehend materials at various rates; In research on this 
matter the different procedures In the measurement of speech rate must be taken 
carefully Into account In Interpreting the results. 

In this talk, I will consider various problems of measuring speech rate as 
they pertain to what Is ordinarily called oral reading rate , that Is, the rate 
at which a speaker reads aloud a continuous prose text which he has not necessarily 
seen before. This Is In contradistinction to spontaneous speech rate> i.e. the 
rate of speaking when the speaker Is continuously composing speech of a ‘'novel” 
character. I shall not deal with spontaneous speech rate, which entails some 
very difficult problems of measurement, although some of the same considerations 
apply to It as in the case of oral reading rate. 

Let us suppose that our problem Is to determine the distribution of the 
rates of a large sample of educated adult speakers of English; If 
the problem Is stated In this way it entails practically all the problems I want 
to discuss and excludes such problems as the sampling of speakers, with which 
I am not concerned. I should note, however, that in reporting norms for reading 
rates, the sampling of the speakers must be adequately specified In terms of age, 
sex, education. Intelligence, background, and any abnormal characteristics such 
as speech handicaps, deafness, etc. 

First, we must know what we mean by rate . In the present context, rate 
must be reported according to the following model: 

X units of speech output per unit of time. 

For example, we might report words per minute, syllables per aeconH. or even 
Eages.per hour, if the pages are of some specified standard size and wordage. 
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We shall speak of the problem of units of measurement In a moment* Right now^ 

I call your attention to the fact that the model given as 

X units of speech output per unit of time 

is not the same as 

Y units of time per unit of speech output* 

This second model is exemplified by such measurements as minutes per 300 words, 
seconds per syllables, or minutes per page . To be sure, these two models are 
related to one another; by doing the appropriate arithmetic, one can be obtained 
from the other. However, reporting a rate in terms of the second model does not 
give measurements that can be as readily apprehended and handled statistically 
as by the first model* 300 syllables per minute is a more readily grasped 
measurement than the equivalent statements *'.33 minutes per 100 syllables" or 
"0.2 seconds per syllable," Furthermore, taking arithmetic averages of measure- 
ments reported in output per unit of time is legitimate, while taking arithmetic 
averages of measurements reported in time per unit of output is not. In fact, 
the two averaging procedures give different results; if the second kind of 
measurement is to be averaged at all, the harmonic mean rather than the arith- 
metic mean must be used. 

All this is Important because a surprising number of investigations have 
used improper procedures. In the literature, the most common procedure for 
measuring speech rate is to clock a speaker while he reads a standard paragraph. 
Thus, Johnson (1961) clocked the times (in seconds) to read a 300-word passage. 
There is nothing essentially wrong in this, but when he proceeded to average 
these times, using the arithmetic mean, the result was misleading since it was 
not the same as the average he would have gotten by converting the times to the 
form Words per minute. Furthermore, generally speaking, measurements taken as 
units of output per unit of time are normally distributed over persons or over 
occasions, while measurements taken as amount of time per unit of performance 
are skewed positively. 
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If very small time units are used, data must be reported to sufficient 
permit projections to larger units* For example, a measurement 
reported as ”3 syllables per second" Is probably Imprecise, unless It is 
actually "3*000 syllables per second," from which one could reliably translate 
to "180*0 syllables per minute*" 

The real bugaboo in this picture Is the question of units* Units of time, 
at least, can be specified without difficulty and can be measured to almost any 
desired degree of precision If sufficient precautions arc taken, although even 
®ttors can be made* 1 remember an occasion when one of ray assistants 
timed something with a stop\7atch graduated In lOOths of a minute, under the 
impression that It was graduated In the normal seconds and fifths of a second* 
Naturally the results were disastrous* But when we consider units of speech 
output, there are some real problems* It Is apparently a standard convention 
to report speech rates In terms of words per minute, and thus far In this talk 
I have spoken about words per minute Just In order to seem conventional* From 
the standpoint of scientific measurement, however, the word Is a very poor unit 
and I wish that we could abolish wpm measurements* It Is not a standard unit, 
for words vary In length from the simple a to the classic antidisestablishment* 
arlanlsm* Furthermore, different samples of prose vary considerably In the 
average length of their words, whether measured in terms of syllables or In 
terms of phonemes* Number of syllables per word figures as an element In various 
measures of readability* 

Let me report In this connection an experiment that Jeffrey Sampson did 
under my supervision at Michigan recently* Six passages from American fiction 
were selected, each containing between 300 and 320 words and from 10 to 15 
sentences. In most respects, the passages were homogeneous both within and among 
themselves; the one respect In which they were deliberately allowed to vary 
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was the syllable-to«word ratio (S/W). These ranged from 1*20, for a passage 
from Ernest Hemingway » to 1.73, for a passage from Henry James. TWenty-four 
speakers were asked to read these passages aloud **at a normal, comfortable rate** 
and they were timed. The order In which the passages were read was systematically 
varied according to a Latin square; the Ss were not aware that the passages 
varied In S/W ratio. The times for Individual subjects were converted Into 
words, syllables or phonemes per minute and then averaged. Table 1 and Figure 1 
show the results. It Is evident that words/mln Is systematically related to 
S/W; syllables/mln much less so, and phonemes/mln still less so. Furthermore, 
the coefficient of variation of the means over passages was greatest In the case 
of words and least In the case of phonemes; In other words, phonemes/mln. gave 
the most consistent results. Syllables do, of course, vary In length, but 
average length of syllable probably does not vary from text to text as much as 
word length. (For the six passages, phonemes/syllable ranged from 2.71 to 2.54 
and tended to be Inversely correlated with S/W ratio; evidently the syllables 
of longer words tend to be shorter than the syllables of short words.) Never- 
theless, since phonemes are difficult to count, for practical purposes I recom- 
mend the syllable as the unit of speech output In measuring speech rate . 

But because the average speech rate In terms of even syllables /minute varies 
someidiat with the nature of the text. It Is probably wise to collect norms on 
a rather wide variety of texts, before we can accept really representative 
values . 

There Is an Interesting sidelight from this study; namely that oral read- 
ing rates differed very consistently over the 24 speakers. Correlations of 
rates among the six passages ranged from .91 to .98. This means that each 
speaker maintained a very consistent rate over the six passages, and these 
rates varied widely. 
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A number of Investigators (e.g. Starkweather^ 1960; Sheam et al .. 1961) 
have used apparatus -produced units; l.e., they report the rate at which the 
Intensity of the speech dlgnal passes specified levels determined by the set- 
tings of electronic gating circuits. Tvo things, should be recognized about 
these measurements: one, they may be affected by the nature of the texts read, 

and:tif<^, they are certainly affected by the parameters of the electronic circuits. 
«U«». satiMtvaMntB should always be accompanied by equations whereby the results 
can be converted to morie standard units such as syllables. 

Let me speak more about the collection of normative data. The rate at 
which a speaker will read a text Is obviously a function of his set. We do not 
know to what extent a speaker can conselounly control his oral reading rate, but 
in collActine nornio we can distinguish at least three distinct sets that can be 
controlled by Instructions. One: we can ask a speaker to read "as fast as he 
can"; Two: we can ask a speaker to read "at a normal, comfortable rate." (This 
was the Instruction used In the data collected by Sampson.) Three: we can ask 
a speaker to read "so as to communicate"-— "as If he were reading the material 
to a friend, making sure that he gets the meaning." Schwartz (1961) demonstrated 
that speakers Instructed to "read so as to communicate" slowed down consider- 
ably from the "normal" rate; from his data I compute that the rate was on the 
average about 87% of the normal rate. (Schwartz, Incidentally, failed to report 
rates In proper form. He reported his measurements In terms of average number 
of minutes to read 5 sentences; thus, he committed three errors: (1) failure 
to specify units of speed output In useful terms; (2) reporting rate as time per 
unit of speech output; and (3) arithmetic averaging of times per unit of output.) 

There are several Instances in the literature In which speakers were re- 
ported to control their rates at designated levels, but one gets the Impression 
that this was accomplished only after considerable trial and error. Harwood's 
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speaker (1955) read at "carefully controlled" rates of 125, 150, 175, and 200 
words per minute; Golds teln"a speaker (1940) read at various rates from 100 to 
a surprising 322 wpm. Note, of course, the unfortunate use of words as the 
units in these measurements. 

I can report a bit of normative data using the "maximum" speed set and the 
"normal" speed set. Ify subjects were 13b college and graduate students, all 
native speakers of English 4 ^or a text of 125 words or 218 syllables (thus, 
syllable-to*word ratio ■ 1.74) , the rates under the "maximum" speed set were 
normally distributed with the following means and standard deviations: 





Mean 


S. D. 


HPM 


205.4 


29.6 


SPM 


358.2 


51.6 



These data can be compared with the data for passage 6 in Figure 1, since the 
syllable/word ratio was similar. For a text of 177 words or 297 syllables 
(thus, syllable-to-word ratio 1.68), the rates under the "normal" speed set 
were normally distributed with the following means and standard deviations: 





Mean 


S. D 


HPM 


172.2 


19.2 


SPM 


289.2 


32.2 



The latter data are consistent with the data reported in Figure 1 if account is 
taken of the SAf ratio. 

The literature contains a number of reports of oral reading rates (Darley, 
1940; Gibbons et al.. 1958; Peters, 1954; Henze, 1953; Starkweather and Hargreaves 
1964), but in most cases the reports are difficult to Interpret because they are 
only in terms of words per minute or because insufficient information is given 
as to the instructions to the speakers, the nature of the texts read, and other 
relevant variables . 
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I will conclude by sirring that in this brief span of time I have been unable 
to cover the literature available or to mention all the variables that appear 
to be relevant. 1 have, however, tried to Identify the main Issues that must 
be reckoned with In measuring speech rates, whether they are Inputs or outputs 
of speedi compression systems. 
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Table 1 

Data on Oral Reading Rates for Six Passages 
Varying In Syllable A^ord Ratio, by Three Methods of Measurement, 

from 24 Adult Subjects. 

Passage^ 





1 


2 


3 


4 


5 


6 


Words 


302 


319 


305 


319 


318 


313 


Syllables 


362 


411 


418 


475 


512 


542 


Phonemes 


980 


1071 


1108 


1205 


1315 


1380 


Syll./Wd. 


1.20 


1.29 


1.37 


1.49 


1.61 


1.73 


Phon./Syll. 


2.71 


2.61 


2.65 


2.54 


2.56 


2.54 



Oral Reading Rates 



Means 

over 

Passages 



Words /Min. 



Syll./Mln. 



Phonemes /Min. 



M 


221.75 


224.47 


191.76 


187.16 


174.38 


169.85 


194.90 


<3^ 


30.71 


26.63 


24.35 


21.92 


21.40 


21.69 


21.22 


M/M 


1.14 


1.15 


.98 


.96 


.89 


.87 


C.V. ■ 


<r/M 


.16 


.14 


.12 


.11 


.11 


.11 


.109 


M 


265.81 


289.22 


262.80 


278.64 


280.77 


294.12 


278.56 




36,82 


34.31 


38.38 


32.60 


34.46 


37.56 


11.34 


M/i? 


.95 


1.04 


.94 


1.00 


1.01 


1.06 


C.V. » 


<^/M 


.13 


.12 


.12 


.12 


.12 


.13 


.041 


M 


719.60 


753.64 


696.60 


706.89 


721.09 


748.61 


724.41 


C 


99.67 


89.40 


88.49 


82.73 


88.48 


95.64 


20.28 


M/M 


.99 


1.04 


.96 


.98 


1.00 


1.03 


C.V. • 


tf/M 


.14 


.12 


.12 


.11 


.12 


.13 


.028 



The passages were slightly edited versions of selections as follows: 

1. Ernest Hemingway, The Old Man and the Sea . (1952) 

2. Sherwood Anderson, Wlnesbure. Ohio . (1919) 

3. P. Scott Fitzgerald, Tender Is the Nleht . (1933) 

4. Thomas Wolfe, Look Homewards Any^ el. (1929) 

5. J. D. Salinger, Franny and Zooev . (1955) 

6. Henry James, Daisy Miller . (1878) 
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